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METHOD AND APPARATUS FOR DISCARDING DATA PACKETS 
THROUGH THE USE OF DESCRIPTORS 

FIELD OF THE INVENTION 

This invention relates generally to computer networks, and more 
5 particularly, to a method and apparatus for discarding data packets through 
the use of descriptors. 

BACKGROUND OF THE INVENTION 

In the field of data routing in computer networks, an Internet service 
provider (ISP) user typically has much more stringent requirements than an 
10 enterprise user because the routers will be subjected to the adverse Internet 
routing environment in the world. There are three typical architectural 
requirements that such routers must support, described below. 

A. Stable Operation. Although it sounds trivial, the notion of stable 
operation has been elusive in the ISP community, as witnessed by various 
p 15 Internet "brown-outs" since it's inception. One paper on Internet scaling 
| "Scaling the Internet during the T3 NSFNET Years", C. Villamizar, October 

22, 1997, articulates the basic requirements which ISPs demand from their 
networking equipment in order to provide a stable network. In addition to 
forwarding performance and scaling requirements, ISPs typically expect 
20 several operational attributes, given below. 

1. Stability under adverse conditions. The router must remain 
stable and deterministic under arbitrarily high traffic loads or a flood of 

routing update changes. 

2. Low packet loss to stable destinations. The effects of unstable 

25 routes (flapping) should not impact a router's ability to forward traffic to 
stable routes. 
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3. Reasonable fairness and congestion control. Sufficient 
buffering capacity, avoidance of head-of-line blocking, advanced queuing 
algorithms, and sophisticated discard techniques must be provided. 

B. Service Differentiation. Recently it has become clear that service 
5 providers cannot make adequate margins by offering flat-rate access and 
undifferentiated service. The ability to offer tiered services, and to guarantee 
service levels, is crucial to the economic and competitive health of ISPs. The 
airline industry's first-class, business-class and coach-class offerings provide a 
meaningful analogy for Internet service differentiation: a small number of 
10 customers are willing to pay for premium service, if it can be guaranteed. The 
concentrator's must enable ISPs to offer differentiated services based on 
fl multiple queues and advanced, intelligent Traffic Management features. 

t c. Superi or Reliability. ISP routers must provide a greater level of 

S reliability and availability than known router architectures. Part of this flows 

1 15 from designing with stability in mind, but providing additional fault 

h tolerance features adds another dimension of resiliency. ISP routers should be 

2 designed without any single points of failure, and all software designs should 
% incorporate fault isolation principles. 

3 Therefore, there is a need for a way to route data in computer networks 
that provides stable operation, service differentiation, and superior reliability. 
Such an invention should be stable under adverse conditions, insure low 
packet loss to stable destinations, and provide reasonable fairness and 
congestion control. 
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SUMMARY OF THE INVENTION 

The present invention provides a method, apparatus and article of 
.manufacture for discarding a data packet. The data packet is classified 
according to a type of service (TOS) indicator and modified with an internal 
service class (ISC) indicator according to the TOS indicator. The data packet is 
edified with a watermark (WM) indicator according to the availability of a 
system resource. The ISC, WM and a drop preference (DP) indicator of the . 
data packet are compared to a committed information rate (CIS). The packet 
is discarded if the DP exceeds the CIR. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example and may be 
better understood by referring to the following description in conjunction 
with the accompanying drawings, in which like references indicate similar 

elements and in which: 

FIG. 1 shows a block diagram of an L3 card forwarding path compare 

with the present invention; 

FIG. 2 shows a buffer pool division compatMe with the present 

invention; 

FIG. 3 shows an inbound descriptor format compatible with the present 
invention; 

FIG. 4 shows a flow diagram of a method for discarding and forwardmg 
data packets compatible with the present invention; 

FIG. 5 shows a flow diagram of a method for discarding data packets 
compatible with the present invention. 



82771.P286 



-5- 



DETAILED DESCRIPTION OF AN EMBODIMENT OF THE INVENTION 

to the following description of an embodiment, reference is made to 
the accompanying drawings which form a part Hereof, and in which is shown 
by way of illustration a specific embodiment in which the disclosed 
technology may be practiced. It is to be understood that other emb— 

Jscope of the disclosed technology. An embodiment of the disclosed 
technology, described below, enables a remote computer system user to 
execute a software application on a network file server. 

The disclosed technology provides a method, apparatus and arhcle 
man ufacturefordiscardingadatapac k et. The data packet is classified 

S m odified with a watermark (WM, mdicator according to the avadabrldy of 

sysl em resource. The ISC, WM and a drop preference (DP) indicator of the 
L packet are compared to a committed .formation rate <CIR). The packet 
is discarded if the DP exceeds the OR. 
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H^rrlware Environment 

PIG 1 illustrates a data path of a packet through a hardware 
environment compatible with the disclosed technology. At step 137, packets 
153 enter theUcard!01 from the access cards 105, lOMrunk card Ul, and 

central process and resource (CPR) 103 cards. At step 139, inbound descriptor 
formation is obtained from the received packet 113 as it is transferred mto 
buf fermemoryll9havingafreebufferpoolll7. The packet type field rs 
compared with PT1 and PT2 type registers, with the PT bits set according. At 
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stepMa ,an i n b oundaesc riP to r nS i s b uUt by co— g the CPR 103 and 
LL card Ul — * one —en, a channel idennhe, — 
infor mation, and a da, packet length is passed directly to the in^und 
descriptor 115. The CPR 103 descriptor information is forwarded 
S memory 121 and the trunk card 111 in—on is forwarded to a FIFO B 

Ins At S tepl«,theinbounddescriptorl25isreadbyaro U tesw 1 tch 
memory 125. /vt^ey* / ( 

a thP help of a command interface 129 
processor (CXP) 127 and processed with the help 

.1^1 At step 145, an outbound descriptor is written by the 
and a L3 coprocessor 131. At step 

, . nr ! « At step 147, the descriptor is directed 
CXP 127 to an output queue selector 133. At step 

107 At steo 149 the outbound descriptor 
10 to the appropriate outbound queue 107. At step 149, 

destination 1S1, and freeabuffer.bufferm^^^^ 
pool 117. 

To summary the CXP 127 reads an inbound descnptor 125, 

15 i ^*^*^*" M " l ~ k * ta T d 

l00kup , buiids the packet outbound descriptor for packet —on, and 
wr Jthe outbound descnptor to the appropriate channei outboun q ueue 
V£L transmission from a DMA controller 135 to a destination 151. 



20 Buiferjvlernory 

mcoming packets 153 are transferred directly from the access cards 105, 

10 , trunk card 111, and CPR 103 cards into the L3P buffer memory 119. An 

a m emoryisdividedmto2KBbuffers20, A data packet is allocated to a b.e r 
fr0 m the 2K or 8K buffer pool as indicated by the len S th recewed m the fust 
word ofthe data packet. Any packet longer than 1536 bytes will be aUocated 
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prior to any attempt for P-per ^ of -ice (QOS, dassihcatron 
L^ino.aeHoensu.p^QOSope^pa^d^aa,^ 

5 poi nt should be avoided. ^ ^ ^ ^ 

The 4MB buffer memory 119, 201, is prerera y 

L d8 Kb yt es>ua t wmbe r eco g n i ze d b y oneo f o«s kl n m tbea rt t h a t 

othe r buffer sizes .nay be used without loss of g enera,ity. The drvtsron 
10 leen.ese.wopoo^sund.—e— Por exampfe, the enhre 

can be allocated to each pool. 

The pointers to the buffers in the 2KB pool and SKB pool are 
stained in two separate stac, memories. Each stac. memory can ho* up 
15 to8 K- 1 (or8 19 a)bufferpoin,ersanatypicaUyhasa 1 3-bitstac k pom,,T, 

1 1 to tbe location 0, it indicates that the buffer poo, is empty. The content 



pool. 
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' Topu tabufferpointeronthestac k ,thestac k pointeri S a d vancedby 

one and thenanew buffer pointer is written. To pop a buffer pointer from .the 
IcMhebuffer pointer on top ofthestachrs read and then thestac k pomter 

is subtracted by one. 
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Watermark Generation 

m one embodiment of the disclosed technology, bits 9, 8, and 7 of the 
2KB Stack Pointer are used as watermark indicator. These bits provide eight 
priority buckets as shown in Table 1 below. 
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Watermark 


Number of Buffers 


000 


0 to 127 


001 


128 to 255 


010 


256 to 383 


011 


384 to 511 


100 


512 to 639 


101 


640 to 767 


110 


768 to 895 


~ 111 


>895 
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Table 1. Buffer Pool Watermark. 

To ensure accurate and timely buffer poo! level indication.** 
watermark bitsat the time of the CXP 127 inbound descriptor Unread are fed 
mt0 the inbound descriptor 125. TheCXP 127 uses these bits to drop packets 
by freeing up buffers during periods of congestion. 

ta FIG 3, an embodiment of the first four words of an inbound 
descriptor is shown. Word 301 contains a buffer index 303, a sequence 
nu mber305,TCPflags307,awatermarkindicator309,apackettype311,a 

fiLl9,andapacket length field 321. Words 323,325,and 327 contain the 20 
byte IP header 329. Word 327 also contains a TCP/UDP source port field 331, 
and a TCP/UDP destination port field 333. 

The format and meaning of the watermark indicator 309 is given in 
Table 1 above. The watermark indicator 309 is used in conjunction wtth an 
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interna, service class (ISC) and a drop preference bit (DP) to determine when 
to drop a packet, as shown in Table 2 below. 
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Table 2. Congestion Clip Table. 

r.e disclosed technology tags an inbound descriptor 123 with the three 
bi twatermar ki ndicator309astheyarereadb y theCXPl 2 7.Thethree 

watermark bits of the watennark Indicator 309 indicate the current congest™ 
ievel of the buffer memory 119 and free buffer pool 117. As various 

HPO B memory 123 is increased by discarding lower priority data packets. 
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Qamj2 LSsmssJQ^) Classification gaMng 

to one embodiment of the disclosed technology, classification and 
policing are accomplished either implicitly from a type of service (TOS) Octet 
in the IP header 329, or explicitly as determined by filtering in specific packet 



82771 .P286 



-10- 



10 



attributes configured by the user. The result of the classification is a 3 bit (8 
level) internal service class (ISC) and a drop preference bit (DP). This 4 bit 
value is used exclusively within the disclosed technology to make QOS 
related determinations. 

The policing algorithm shown in FIG. 4 uses user programmable 
parameters to determine if the traffic rate being received has exceeded the rate 
of the traffic contract. In certain situations it is desirable to re-classify packets 
as determined from their traffic utilizations. In these cases the ISC/DP values 
may be modified during policing operations. At step 401, a lookup key is built 
using the ISC, DP and watermark values. At 403, a lookup is performed. If at 
step 405, a threshold is exceeded, at step 407 the number of octet/packet clip 
drops is counted, and the packet is dropped at step 409. Otherwise, at step 411 
the packet is forwarded to the packet destination. 



15 nnq (-nicest'™ Management 

In order to differentiate between contracted service levels, there should 
be traffic management mechanisms in place to ensure that higher precedence 
traffic has a better chance of making it through the concentrator than best 
effort traffic, especially in the case of system congestion. This system 
20 congestion is indicated by low resources in the free buffer pool 117. It is 
desirable to ensure 'premium' class traffic over 'best effort'. 

In order to make an intelligent choice on which packets to discard and 
which to keep, at least three parameters are evaluated: 1) the severity of the 
resource shortage, indicated by watermark bits 309 in the inbound descriptor 
25 125, 2) the ISC of the packet, and 3) the DP indication, a traffic rate exceeding 
its committed information rate (OR) may be subject for quick discard. The 
CXP 127 performs this evaluation by concatenating the ISC, DP, and WM 
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values into a key for a lookup into a configurable congestion clip table, an 
example of which is shown above in Table 2. The lookup will either 'hit', 
indicating the packet continues to be forwarded, or 'miss' in which case the 
packet is discarded or 'clipped'. 
5 FIG. 5 shows an example flow diagram of a method for discarding data 

packets. At step 501, a data packet is classified according to a type of service 
(TOS) indicator. At step 503, the data packet is modified with an internal 
service class (ISC) indicator according to the TOS indicator. At step 505, the 
ISC is compared to a committed information rate (CIR). If at step 507 the ISC 
10 exceeds the CIR, the data packet is dropped at step 509. 

It is desirable that in these congested scenarios that the lesser 
K precedence packets can be dropped at a rate fast enough to allow for higher 

^ precedence packets to continue to be forwarded and also to drain the inbound 

1 descriptor queues (FIFO A memory 121 and FIFO B memory 123) in order to 
£p 

in 15 replenish the free buffer pool 117. 

h While the invention is described in terms of preferred embodiments in 

2 a specific system environment, those of ordinary skill in the art will recognize 
J that the invention can be practiced, with modification, in other and different 
^ hardware and software environments within the spirit and scope of the 

20 appended claims. 
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